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1 We characterise the typical offset between the Dark Matter (DM) projected cen- 

tre and the Brightest Cluster Galaxy (BCG) in 10,000 SDSS clusters. To place con- 
straints on the centre of DM, we use an automated strong-lensing (SL) analysis, mass- 
modelling technique which is based on the well-tested assumption that light traces 
mass. The cluster galaxies are modelled with a steep power-law, and the DM com- 
ponent is obtained by smoothing the galaxy distribution fitting a low-order 2D poly- 
nomial (via spline interpolation), while probing a whole range of polynomial degrees 
and galaxy power laws. We find that the offsets between the BCG and the peak of 
the smoothed light map representing the DM, A, are distributed equally around zero 
with no preferred direction, and are well described by a log-normal distribution with 
(log 10 (A [/i^Mpc])) = — 1.895±g;gg|, and a = 0.501 ± 0.004 (95% confidence levels), 
or (log 1? (A ["])) = 0.564±0.005, and a = 0.475±0.007. Some of the offsets originate in 
prior misidentifications of the BCG or other bright cluster members by the cluster find- 
ing algorithm, whose level we make an additional effort to assess, finding that ~ 10% 
of the clusters in the probed catalogue are likely to be misidentified, contributing to 
k> . higher-end offsets in general agreement with previous studies. Our results constitute 

the first statistically-significant high-resolution distributions of DM-to-BCG offsets 
obtained in an observational analysis, and importantly show that there exists such 
a typical non-zero offset, in the probed catalogue. The offsets show a weak positive 
correlation with redshift, so that higher separations are generally found for higher- z 
clusters in agreement with the hierarchical growth of structure, which in turn could 
potentially help characterise the merger, relaxation and evolution history of clusters, 
in future studies. In addition, the effective DM centre we adopt here, namely the peak 
of the smoothed light distribution representing the DM, can constitute a natural and 
alternative definition of cluster centers for optically-selected cluster catalogues. 

Key words: cosmology: observations, dark matter, galaxies: clusters: general, grav- 
itational lensing: strong, gravitational lensing: weak, mass function 



1 INTRODUCTION 

Galaxy clusters are the largest gravitationally-bound objects 
in the Universe, and are formed at later stages and relatively 



lower redshifts, in the hierarchical model. As such, galaxy 
clusters can shed light on the high end of the cosmic mass 
function and the evolution history of the Universe, and pro be 
the acceptable cosmological model (e.g. lAllen et al.ll201l[ ). 
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Digitised large sky surveys such as the Sloan Digital Sky 
Survey (SDSS; see lYork et e01|2000| : lAbazaiian et al.lbooa 
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2009), and increasing computational power, have driven in 
recent years statistical analyses of extensively large clus- 
ter samples. In these (mainly optical imaging) data, galaxy 
clusters are identified usually in an automated manner , 



via dedicated finding algorithms (e. g. Postma n et al 



Kepner et al.l 



Koester et al.l 



19991; 



Gal et al.l |2000|; iGladders fc Ye. 



2007b and references therein; Pierpaoli et al.l 



1996 



2000 



20 111 ). In addition, recent developments allow an indepen- 



dent detection of unprecedentedly large numbers of galaxy 
clusters in the X-ra y and Sunyaev-Z eldovich Effect (SZE) 



obser v ations (e.g. iBohringer et al.l |2003; Ebeling et al.l 
19981; IVanderlinde et all ]2010l; IWilliamson et al.l l201lF 



Planck Collaboration et al.ll2011ah . Very large cluster sam- 
ples have been used in many recent s tudies such as 
analyses of weak and strong lensing (e.g. Ijohnston et alj 
l2007al ; iMandelbaum et al.l 120081 ; IZitrin et al.l I2012T ), or to 
place constraint s on the cosmological parameters (e.g. 
iRozo et all l2010h and establish various scaling relations 



Rozo et al. 120091 ; IPlanck Collaboration et al.l l2011bl: 



iBauer et al.l 2012 ; see also Kravtsov fc Borgani 20121 ) asa 



few examples. 

One of the major factors of noise or systematic un- 
certainty in these studies, especially when the clusters are 
identified optically and when the analysis is dependent on 
a predetermined centre, such as a stacked lensing anal- 
ysis, c luster-background galaxy cross-correlation meas ure- 



ment (Johnston et al. 



also lOguri et al.ll2010l : 



2007bl; IMandelbaum et ail l20ld; see 



Umetsu et al. 2011al ). is the "misen- 



tring" of the BCG with respect to the dark matter (DM). 
Such offsets may result from either a misidentification of 
the cluster finding algorithm, or by a real measured offset 
between the projected DM centre and the BCG (see also 
I Johnston et al.lf2007bl ). and will ha ve a smoothing effect o n 
the, e.g., stacked lensing signal (see lOguri fc Takadall20l"lT ). 
A unique way to trace the offsets of the correctly identified 
clusters (hereafter, for simplicity, we dub these true, or real 
offsets to distinguish them from the misidentified clusters), 
to high accuracy, is by a strong-lensing (SL) analysis, where 
usually the multiple images in the core are used to accu- 
rately map the galaxies and dark mas s distributions (e.g 
IZitrin et al.ll2009bl ; lUmetsu et aI1l2011bl ). 

Due to the importance of the miscentring effect, there 
have been recent efforts to c haracterise its amplitu de and 
size distribution. For example. Ijohnston et al. (2007b) quan- 
tified this effect by using N-body simulation-based mock 
galaxy catalogues, and then run ning their cluster-find ing 
algorithm (maxBCG in that case; iKoester et aill2007bl ) to 
identify clusters in these catalogues, comparing the resulting 
BCG positions to the centres of the DM halos in the input 
simulations. They found that the offsets, i.e., for the clusters 
that are not centred on their BCG, are well described by a 
2D gaussianoftheformP(J? s ) = ff exp(-±(-^) 2 ), with R a 
being the magnitude of the offset and a — 0.42 h^ 1 Mpc, and 
that the effect they traced is indeed dominated by misidenti- 
fied BCGs. More recently Iffilbert fc White! (|2010h similarly 
found a = 0.34 - 0.41 ft" 1 Mpc (for different WMAP cos- 
mologies). Ijohnston et alj l|2007bl ) also found that the frac- 
tion of misidentified clusters decreases with richness: ~ 60% 
of the poorer (N ga i ~ 10) clusters are correctly identified, 
versus ~ 85% of the richer (N ga i ~ 100) clusters. Although 
using these numerical simulations one can estimate the level 
and distribution of misidentified clusters, in such procedures 



the galaxies are assigned to DM halos in the simulation to 
begin with, so that the true offset distribution, i.e. for clus- 
ters whose BCG was correctly identified, is hard to assess. 

Observationally, several studies also examined the off- 
sets between the BCG and the X-ray peak (or centroid), or 
recent ly, also with the SZ peak. For example, iLin fc Mohrl 
(2004) listed the BCG to X-ray peak of fsets for a few dozen 
Two Micron All-Sky Survey (2MASS, Ijarrett et ail |2000D 
clusters. They found, that about 75% of the identified clus- 
ters lay within 0.06r2oo, and 90% within 0.38r2oo, with a 
~ 10% contamination level of possibly misidentified BCGs. 
They suggested, that given these high fractions of small X- 
ray to BCG separations, the timescale for the BCG to sink 
to the cluster potential minimum may be short compared 
to the relaxation timescale of the intracluster gas. Or alter- 
natively, the scale of merger required to perturb the X-ray 
properties of the cluster could be smaller than the scale of 
the merger required to o ffset the BCG from the c luster cen- 
tre (|Lin fc Mohrl 120041 ). iMann fc Ebelind (|2012l ) examined 
the X-ray peak and centroid offsets from the BCG, in 108 
of the most luminous X-ray clusters, with the goal of con- 
straining the evolution with redshift of the cluster merger 
fraction, so that they also characterised the evolution of 
such offsets in redshift. They found that the distribution 
is roughly log-normal, and centred at 11.5 and 21.2 kpc for 
the offset of the BCG from the X-ray peak, and from the 
X-ray centroid, respectively. In addition, they found an evo- 
lution for these offsets with redshift, so that higher sepa- 
rations are generally expected for higher reds hift clusters, 
proba bly as a sign of higher merging fraction. ISehgal et al.l 
l|2012h recently examined the SZ signal for 474 optically- 
selected (maxBCG) clusters and 52 X-ray selected (MCXC) 
clusters using data from the Atacama Cosmology Telescope 
(ACT). For the optically-selected sample, they found that 
the Planck and ACT measurements could be explained if 
one assumed that the BCGs are offset from the SZ peaks 
uniformly between and 1.5 Mpc. However, they point out 
that other factors (rather than the BCG offset) could be 
in play in explaining the observed discrepancies, especially 
since for the X-ray-selected sample, a much narrower BCG 
to X-ray peak offset distributio n was found , peak ing within 
0.2 Mpc. In recent work, also, ISong et all (|2012l ) find that 
the BCG to SZ centroid offset distribution, in 146 South 
Pole Telescope selected clusters with BCGs well identified 
in follow-up optical and near-infrared observations, is simi- 
lar to that found previously in X-ray samples. 

lOguri et all (|2010l ) observationally examined the offset 
of the BCG from the centre of mass obtained in weak lens- 
ing (WL) analyses of a sample of 25 clusters. They found 
that the DM centre is overall consistent with that of the 
BCG (within 2a level), and that the observed distribution 
can be described by two components. The first, significant 
component describing the small offsets, is a 2D Gaussian 
with a — 0.09 h^ 1 Mpc, and the second less-significant 
component describing the t ail of larger separations, is fit- 
ted by the Ijohnston et al.l l|2007bl ) finding: a 2D Gaussian 
with a = 0.42 ft" 1 Mpc. They conclude d that about 1 0% o f 
their sample are BCG-offsetted clusters. lOguri et all (|2010l ) 
also probed the same effect by complementary X-ray fits for 
their sample, but found no specific correlation as the X-ray 
centroids are usua ll y more closely centred on the BCG (e.g. 
ILin fc Mohil |2004| ; [Maughan et al.l 120081 : IMann fc Ebelinel 
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I2012I , although see also IShan et al.l l2010al ) . They also note 
that the typical error on the mass centroid measurement 
in their WL analysis is ~ 50 h^ 1 kpc, and there are a 
few clusters in their sample that have errors larger than 
100 h' 1 kpc, which are non-negligible compared to the 
widths of the resulting 2D Gaussian distributions. In ad- 
dition, iDietrich et al.l (|2012h recently found, by thoroughly 
simulating WL observations, that generally, the magnitude 
of peak-offsets in WL maps could primarily be a direct re- 
sult of shape-measurement noise and smoothing of the WL 
mass maps. It is thus clear that for a more quantitative as- 
sessment of the distribution of BCG offsets, a much larger 
sample is nee ded, probed w i th suffi ciently high resolution. 

Recently, iGeorge et al.l l|2012h examined in 129 galaxy 
groups (halo masses of up to 10 14 M ) in the COSMOS 
field, the offset between the WL halo centre and other trac- 
ers such as the BCG, most massive group galaxy, or X-ray 
centroid. They found that the BCG is one of the best tracers 
of the centre-of-mass, and is offset typically by less than 75 
kpc from the (dark) halo centre. In addition, they concluded 
that similar offset distributions are highly susceptible to the 
centre definitions, i.e., centres defined following certain in- 
tensity centroids, can largely differ fro m those defined vi a 
the corresponding intensity peaks fsee lGeorge et al]|2012h . 
and that not accounting for the miscentring effect can cause 
a 5-30% bias in stacked WL analyses. The X-ray to BCG 
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Figure 1. Effect of the prior smoothing polynomial degree S on 
the posterior distribution of offsets between the BCG assigned by 
the GMBCG catalogue and the DM centre (for fixed q = 1.2). As 
higher S values entail usually steeper mass profiles, and have more 
degrees of freedom to describe the substructure, higher S values 
entail generally smaller BCG-DM offsets, converging towards the 
BCG near the boundary of our chosen range, S = 24. 



vious studies mentioned therein (e.f 


' Sheldon et all l200ll; 


iKoester et al. 2007a: Sanderson et al. 


20091. finding tvoicallv 



a fe w dozen kpc offsets or less). 

[Sh an et al ] (|2010af ) characterised the offsets between 
the X-ray p eaks and lensing centres in 38 clusters (see also 
lAllenlfl99a ). Although most clusters show small offsets, as is 
also usually seen in such lensing-analyses centre to BCG 
comparison (e.g. ISmith et all 120051 ; iRichard et all 120101 1. 
about 45% of their clusters, usually the merging, multiple- 
clump ones, show larger separations t han 10", with a maxi- 
mum of ~ 54" (or ~ 200 kpc, see also I For ero- Romero et all 
2010). This, however, may be a result of either large frac- 
tions of unrelaxed clusters in their sample (~ 60%), and, the 
ensemble of different SL techniques used for the compari- 
son, many of which pre-assume or iterate for the DM cen- 
tre while adopting a symmetric DM distribution (see refer- 
ences therein), which may be unrealistic given the perturbed 
and complex matter distribution seen especially in unrelaxed 
clusters. In that sense, such offsets or even the known dis- 
crep ancy between mass estimates fr om lensing and X-ray 
(e.g. lAllenlll99Sl; IRichard et alj|2010h . may not be surpris- 
ing (see lShan et al.ll2010bl). ~ 



In a recent work. 



Einasto et al.l (|2012T ) studied substruc- 



ture and multimodality in groups and clusters of galaxies in 
SDSS DR8, using several tests to characterise the member 
distribution. They found that the distribution of distances 
from the cluster (or component) centre for the correspond- 
ing brightest galaxies shows that most of these galaxies are 
located preferentially close to the (sub)cluster centre (typi- 
cally of an order of ~ 0.1 h" 1 Mpc scales, see Fig. 4 therein), 
although substantial numbers can show also higher separa- 
tions (typically of a ~ 1 h^ 1 Mpc scale). 

Here, we aim to characterise the typical offset between 
the DM peak and the BCG of a statistically-significant sam- 
ple, with a successful mass-modelling tool for SL analyses 
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Figure 2. Effect of the prior galaxy surface-density power-law q 
on the posterior distribution of offsets (for fixed S = 12). Com- 
pared to the effect of the polynomial degree seen in Figure [l] the 
effect of the galaxy power law q on the resulting distribution and 
the location of the peak, is negligible. 



ijZitrin et al.ll2009bf ) which does not require the DM centre 
to be predetermined, nor assumes a symmetric or any partic- 
ular pre-known shape for the DM distribution. This method 
was recently adapted for automated use on 10,000 SDSS 
clusters, deducing the first observational , universal distribu- 
tion of Einstein radii l|Zitrin et al.l 2012). The advantage of 
this approach in evaluating the BCG offsets is that the res- 
olution is very high (an order of 0.1"), and that the analysis 
can be performed blindly, on a very large sample. The results 
of this study will constitute the first statistically-significant, 
observational measure of this effect, and supply a comple- 
mentary measure to compare to the previous studies men- 
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Figure 3. 2D (posterior) distribution of offsets between the BCG 
assigned by the GMBCG catalogue and the DM centre. As can 
be seen and expected, the offsets are distributed equally around 
zero with no preferred direction. 



tioned above. In addition, the offset between the DM peak 
and the BCG may eventually help characterise the evolution 
of cluster galaxies and their host clusters, as more relaxed 
clusters can be anticipated to show decreasing or negligible 
offsets. 

We analyse here the same 10,000 clusters from 
IZitrin et all (|2012T ). which were randomly drawn from the 
Gaussian Mixt ure Brightest Cluster Galaxy (GMBCG; 
lHao et aLllioIoh SDSS cluster catalogue. In practice, these 
span the full redshift and richness ranges covered by the full 
catalog. The clusters had been found usin g the Error Cor - 
rected Gaussian Mixture Model algorithm l|Hao et alj|2009l ) 
to identify the BCG plus red sequence feature, convolving 
the identified red sequence galaxies with a spatial smoothing 
kernel to measure the clustering strength of galaxies (within 
0.5 Mpc) around BCGs. The technique was applied to the 
Data Release 7 of the Sloan Digital Sky Survey and produced 
a catalogue of over 55,000 rich galaxy clusters in the redshift 
range 0.1 < z < 0.55. The catalogue is approximately vol- 
ume limited up to redshift z ~ 0.4 and shows high purity 
and completeness when tested against a mock catalogue, 
and when compared to other well-established SD SS cluster 
catalogues such as maxBC G (jKoester et al.|[2007al ; for more 
details see lHao et al.ll2O10h . 

The paper is arranged as follows: in §2 we detail the 
method incorporated in order to obtain the BCG-DM off- 
sets. In §3 we report the results and factors of uncer- 
tainty, which are further discussed in §4 and concluded in 
§5. Throughout we use a standard ACDM cosmology with 
(fl m o = 0.3, J7ao = 0.7, h = 0.7), and distances are usually 
given in h" 1 Mpc, to ease the comparison to other works. 
To avoid possible confusion, we also note that all logarith- 
mic quantities and syntax in this work are in base 10, un- 
less stated otherwise, and are denoted equivalently as either 
"log 10 " or "Log". 



- N(A) raw 
N(A) with random measurment noise 
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Figure 4. Posterior distribution of offsets between the BCG as- 
signed by the GMBCG catalogue and the DM centre obtained as 
described in J2] The black curve shows the raw measurement, and 
the red curve includes 1 SDSS pixel (~ 0.4" /pix) random mea- 
surement noise, which we adopt hereafter for our analysis. The 
curve peaks at ~ 0.0042 h~ 1 Mpc (or 1.3"if binned in arcseconds 
rather than physical distance), and with a median of 0.0136 h~ 1 
Mpc. Following previous work, we tried to fit a 2D Gaussian to 
the distribution, although this yields a poor fit of R 2 ~ 0.7 at 
best. The curve, however, is very-well described by a log-normal 
distribution, as we discover here and show in Figure [5] 



2 METHOD 

Our analysis starts with the basic assumption that light 
traces mass. For each cluster, members were identified and 
listed in the GMBCG catalogue, following a photo-z, lumi- 
nosity, and distance filters, and the red-sequence definition 
in the g — r colour for zi < 0.43, or the r — i colour for 
zi > 0.43 (see lHao et alj|201ol . for more details). We assign 
listed members in the catalogue with a power-law surface 
density profile q, S(r) oc r~ q (where r is the radius from the 
galaxy), scaled by their luminosity, so that the superposition 
of all galaxy contributions in the cluster field represents the 
galaxy "lumpy" component of the mass model. The lumpy 
mass distribution is then smoothed to obtain the DM distri- 
bution, where the smoothing is performed by fitting a (rel- 
atively) low-order polynomial using 2D spline interpolation. 
The polynomial degree, S, determines in practice the posi- 
tion of the DM centre, or peak, so that to cover all options, 
we must take into account all reasonable polynomial degrees 
(as explained shortly), and to sample a wide-enough range 
of power laws to verify this choice does not have an effect on 
the final result. Throughout, the DM centre is simply taken 
as t he highest surface de nsity peak of the DM map (see also, 
e.g. iNavarro et alj|20ich . 

This a nalysis is based on t he lens-modelling method de- 
ve loped bvlZitrin et alj (l2009bl) , adopting the approach used 
in iBroadhurst et al.l ( 20051 ). where for our purpose here we 
do not need to obtain the final (scaled) mass distribution, 
but only to use the first two steps: generating the lumpy and 
smooth (unsealed) components of the mass model. Clearly, 
the overall relative scaling of each component, obtained usu- 
ally in a minimisation procedure using multiple-images, is 
irrelevant to the question of where the DM centre is, es- 
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Log(A [Mpc/h]) 

Figure 5. Posterior logarithmic distribution of DM-to-BCG off- 
sets. As is clearly seen, the offsets (green triangles) are well de- 
scribed by a log-normal distribution, with (log 10 (A [/i _1 Mpc])) = 
-1.898±o°°3, and a = 0.489 ± 0.004 (95% confidence levels; 
R? = 0.988). The fit is shown in a dash-dotted red line, done for 
all log 10 (A [ft^'Mpc]) < —1.11 offset values thus excluding the 
artefact caused by our SL analysing frame size at higher offsets. 
This artefact is then filtered out by a simple noise cleaning proce- 
dure described in A3. 21 and the result is overplotted in blue dots. 
The resulting cleaned distribution maintains the same log-normal 
shape, width, and position, as can be seen by the solid black curve, 
with (l°gio( A [ft _1 Mpc])) = -1.895+°'qq4, cr = 0.501 ± 0.004, 
and a slightly better R 2 of 0.992. This is fully consistent with 
the estimate by forcibly excluding outliers. Although similar, the 
latter fit, of the clean sample, is the final official result we adopt 
throughout. 



pecially since the galaxy component is known to comprise 
anyhow only a small fracti on of the total mass distribution 
(see also lZitrin et al.ll2012i ). 

Furthermore, we n ote that by perform ing the simple 
procedure described in IZitrin et all l |2009bh . namely, rep- 
resenting the red-sequence galaxies by a power-law surface 
density profile, smoothing it in order to obtain a repre- 
sentative DM component, and combining the two compo- 
nents with a reasonable and calibrated relative weight, we 
were able to immediately identify unprecedented numbers 
of multiple-images in many cluster fields. These multiple- 
images are physically found by this initial mass model, im- 
plying that this parametrisation is most successful in de- 
scribing the underlying (projected) mass distribution. When 
performing a detailed mass modelling using multiple-images, 
one goes over a representative range of q and S values, and 
minimises according to the constraints comprised by the 
multiple-image locations and redshifts. In practice, these two 
parameters are controlling the overall mass profile (steeper 
power-laws and higher polynomi al degrees usuall y entail 
steeper total mass profiles, see IZitrin et al.l l2009bl ). which 
is constrained using the angular-diameter distance to the 
different multiple systems. 

Here, we do not use multiple images as constraints. The 
large numbers of multiple- images found by the mass models 
constructed following the simple procedure described here, 
in previous clusters we analysed, use as an evidence and 
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Figure 6. Posterior logarithmic distribution of offsets, in arc- 
seconds, between the BCG and the DM centre. The offsets 
(green dots) are well described by a log-normal distribution, with 
(log 10 (A ["])) = 0.557 ± 0.004, and a = 0.501 ± 0.004 (95% con- 
fidence levels; R 2 = 0.995). The fit, shown in a dash- dotted red 
line, is performed while ignoring values below log 10 (A ["]) < 1.3, 
thus excluding the artefact caused by our SL analysing frame size 
at higher offsets. Similar to Figure [5] this artefact is filtered out 
by a simple noise cleaning procedure described in £13.21 and the 
result is overplotted in blue dots. The resulting cleaned distribu- 
tion maintains the same log-normal shape, width, and position: 
<log 10 (A ["])) = 0.564 ±0.005, a = 0.475 ±0.007, and R 2 = 0.99, 
clearly consistent with the result obtained prior to the cleaning 
procedure by excluding outliers. 

show that this method describes with unprecedented re- 
liability the underlying mass distribution. In other words, 
the ~ 30 cl usters we accurately analysed with this me thod 
to date (e.g. iBroadhurst et al 1 l2005l ; IZitrin et al.lboilal lblRl , 
show unambiguously that the q and S parameter space in 
the relevant range contains a representative solution for the 
cluster mass distribution, and hence, the DM centre. The 
polynomial degree, S, controls also the effective centre of 
the DM component, for a given power-law, q. Here, as men- 
tioned, since we do not use multiple-images as constraints, 
we go over a full range of polynomial degrees to account for 
all possible positions of the DM centre, probing also different 
power-law values to test their influence on the resulting DM 
peak position. The realistic ranges and values we adopt are 
[1 < q sC 1.5] with Aq = 0.1, and [4 sC s s: 24] with AS = 4, 
so that the q~ S grid comprises 36 measurements per clus- 
ter. This prior distribution we adopt is flat, so that each 
point on the q — S grid is equally weighted. This range also 
contains the parameter values obtained in the minimisation 
procedure when performing detailed SL analyses of many 
cluster lenses mentioned above. Moreover, a higher polyno- 
mial degree than the range we adopt here does not affect 
the DM centre significantly, i.e., the DM peak converges to- 
wards the BCG centre with higher polynomial degrees, so 
that above s ~ 20 the DM peak will usually approach the 
BCG, as there are enough degrees of freedom to describe 
smaller substructures (less smooth a fit). Not to create a 
bias towards smaller offsets, we therefore do not use smooth- 
ing degrees above s = 24. See Figures [T] and [2] for the effect 
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of the q and S parameters on the posterior distribution of 
BCG-DM offsets (this is discussed further in %£3j) . This in 
fact leads to the key point of our analysis: we sample the 
full range of possible DM centre positions per cluster, which 
are determined by the parametrisation we described, and 
entailed from the original galaxy distribution. In that sense, 
the DM centre is approximated by the weighted centre of lu- 
minosity, which is also affected therefore by the luminosity 
density and not only the brightest single member: clusters 
with evenly distributed galaxy luminosity around the BCG 
are more likely to show very small range of offsets regardless 
of the polynomial degree, while clusters with concentrated 
luminous and massive group(s) close to the BCG may show 
a wider range of possible DM-BCG offsets, strongly depend- 
ing on the polynomial degree. 

Throughout we work in a field-of-view of 120" x 120", 
since this is typically a large-enough size to describe the 
SL field. For example the lar gest Einstein radius known to 



date, see Zitrin et al. (|2009aT ) , is ~ 55" for redshift of z ~ 3 



(|Limousin et alj|2012f ). and since the critical curve is ap- 
proximately centred on the centre-of-mass, it is clearly not 
physically likely that the true offset would be significantly 
larger than this (except for, say, clusters with other galaxies 
which are indeed brighter than the massive galaxy sitting at, 
or close to, the potential minimum, see £|3. 2[) . Also, from pre- 
vious work we know that usually lensing clusters show up to 
a ~ 5" offset or less (although some may show a significantly 
higher offset, see below), as revealed i n detailed HST-based 



SL anal yses (e.g., Zitrin et al. 2009a b; Umctsu c t al.|[20"llbl . 
see also lSmith et al.ll2005l : lRichard et alj|20ich. This is indi - 
rectly also supported by the result of Johnston et alj |2007bl ) 
who found that misidentified clusters in a similar SDSS cat- 
alogue, show a 2D Gaussian distribution with a = 0.42/i _1 
Mpc, and that correctly identified clusters should comprise 
offsets much closer to the B CG (as seen also i n the other 
studie s mentioned in jjH e .g. lOguri et al.ll201ol : IShan et all 
l2010al : iGeorge et al] 2012), which we have taken upon to 
characterise here. In that context, it is worth mentioning 
that the method we apply here is capable of tracing also 
larger offsets than few arcseconds (if entailed by, e.g., the 
location of multiple images), and thus can probe a large 
enough offset range. One such example is the largest Einstein 
radius cluster mentioned above, Macs J0717. 5+3745, with 
an offset of ~ 30" measured from our model presented in 
IZitrin et alj |2009al ). However we also note, that were there 
correctly identified clusters with significantly larger offsets 
than we probe in our work (e.g., much further away from of 
the SL region or the FOV we work in), these might not be 
uncovered by our method, although we conclude from the 
above considerations, that these cannot be expected to be 
significant. 

The procedure implemented here can be thought of as 
simply smoothing the red sequence member light distribu- 
tion, in order to obtain the range of possible DM centres. As 
simple as it may sound, the success of the light-traces-mass 
assumption in constantly describing the mass distribution in 
many lensing clusters, and the large statistical sample which 
helps to reduce the statistical uncertainties, aid us to obtain 
a well-constrained solution for the distribution of BCG-DM 
offsets, as seen in the following section (SJS]). 



3 RESULTS AND UNCERTAINTIES 

In this section we present the results obtained by perform- 
ing the analysis described in assess the level of noise and 
uncertainty in them, and describe the noise-cleaning proce- 
dures taken. 



3.1 Offset Distribution 

We examine the 2D distribution of BCG-DM offsets ob- 
tained in our analysis. In Fig.[3l we show the 2D distribution 
of offsets for the 10,000 clusters analysed in this work. As 
can be seen and expected, the offsets are equally distributed 
around the centre in all directions, with no particular direc- 
tion preference. Our first step, is to try and fit a 2D Gaus- 



sian to thes e data, P(R a 



■ exp ( — | ( ^) 2 ), motivated b y 



the works of lJohnston et aL | (2007bl ) and lQguri etHI \20Vj} . 
where R a is the offset size (which we denote here throughout 
equivalently as A). Figure [4] shows the probability density, 
or the number of clusters per (radial) offset bin, where we 
add therein the same distribution obtained by taking into 
account random noise of blending due to the SDSS relatively 
large pixel scale, ~ OA" /pix, although its effect on the lo- 
cation of the peak is negligible as seen. The peak of the 
distribution is very small, 0.0042 h~ x Mpc, but significantly 
larger than zero, which is one of the important results of 
this work as we discuss below (^4}. 

No 2D Gaussian is capable of describing well the ob- 
served shape of the distribution. The best obtained 2D Gaus- 
sian fit has a poor goodness-of-fit estimator of R 2 ~ 0.7 at 
best. We also note, that we repeated the procedure in arcsec- 
ond bins rather than h' 1 Mpc bins. The resulting histogram 
peaks at 1.3", has the exact shape as the histogram seen in 
Figure [3] and the same inconsistency with a 2D Gaussian 
fit. 

We now remake the histogram, or probability dis- 
tribution of offsets, in logarithmic bins. The logarith- 
mic distributions are plotted in Figures [5] and [6] and 
are clearly well described by log-normal distributions (of 
the form /(log 10 (x)) oc exp( -('°sio W-j]°gio(*)» 2 ^ with 
(log 10 (A [ft _1 Mpc])) = -1.898±£;°o3, and a = 0.489±0.004 
(95% confidence levels; R 2 = 0.988), and (log 10 (A ["])) = 
0.557 ± 0.004, a = 0.501 ± 0.004, respectively. Outliers, de- 
fined as log 10 (A [ft^Mpc]) > -1.11, or log 10 (A ["]) > 1.3, 



are excluded from the fit, as we now describe ( £I3,2[) . 
3.2 Noise and Uncertainty 

In Figures [5] and [6] a significant secondary peak is seen at 
log 10 (A [ft^Mpc]) ~ -0.6, or log 10 (A ["]) ~ 1.8, which 
deviates from the log-normal shape that characterises the 
distribution at lower offsets. The origin of this peak is quite 
clear, as it forms around the edge of the field-of-view (FOV) 
size we use for our lens modelling, of 120" x 120", or an 
effective radius of ~ 60 — 85" (EJ, so that clusters ex- 
hibiting (unrealistic) offsets larger than this, are counted 
at that maximal offset range. This range translates into 
~ 0.078 - 0.11 h' 1 Mpc for the redshift range of the sam- 
ple, 0.1 < z < 0.55, or covering the logarithmic offset range 
of -1.11 < log 10 (A [h _1 Mpc]) < -0.42, exactly where the 
peak is seen. The first step we therefore perform, is compare 
the integral of the log-normal distribution fit to the main 
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peak in the range of the secondary peak, to the area enclosed 
below the data themselves. We obtain that the area enclosed 
by the data in that range, is ~ 2.5 times the area expected 
by the fit, implying a noise level of ~ 150% in that particular 
"problematic" range. We find that about 1250 clusters out 
of the 10,000 analysed contributed to this secondary peak 
(or close to 50,000 out of 360,000 individual measurements), 
where only ~ 500 are expected by the fit. Since true (rather 
than misidentified) clusters are not expected to have such 
large offsets, this fraction we adopt as the level of misidenti- 
fied clusters in the catalogue, roughly (1250 - 500)/10000 ~ 
8%. We also address the reader to Figure where we show 
different examples of clusters probed by our method, with 
a clear deviation for misidentified clusters. Clearly, there 
could be other misidentified clusters or BCGs overlooked by 
this procedure, that contribute to the smaller offset range 
inputting some a dditional noise to the offset distribution . 



Previous studies llLin & Mohrll2004l: 1 Johnston et al.1 


2007b: 


Koester et al. 2007al; lOguri et al. 20ld; lHao et al. 


I201C; 


Andreon & Morctti 201li; Skibba et al. l201ll; Georee et al. 


2012) showed that usually an order of ~ 10 — 30% 


of the 



clusters in such catalogues are misidentified, so that after 
accounting for the misidentified clusters which affect the 
high-end offsets, the remaining number of misidentified clus- 
ters in the sample can be expected to be reasonable (note 
also that some of the contribution of the same misidenti- 
fied clusters to the small offset range was automatically ex- 
cluded when filtering out problematic clusters as above). 
The overall level of misidentified clusters in the sample can 
be therefore expected to be around ~ 10 — 20%, or even 
up to a hi gher level of around ~ 30% following previous 
works (e.g. Ijohnston et al.ll2007bT ). Remaining discrepancy 
from the ~ 10% level we find, may be attributed to the 
amount of misidentified clusters that are not traced as such 
by our method, thus implying a possible ~ 10 — 20% (addi- 
tional) noise level on our results from such clusters. 

The second step, is to search whether there exists a 
certain population of clusters which contributes specifically 
to this artefact, and later, verify that by excluding these 
clusters, the log-normal fit to the true lower offsets, ob- 
tained in H3.ll remains consistent. We therefore examine 
various properties as a function of log 10 (A), looking for 
distinguishable populations. For example, we probed if the 
log 10 (A [h~ Mpc]) > —1.11 points originate from a certain 
redshift, richness, or relaxation-degree populations. No spe- 
cific distinguishable population (in terms of relative num- 
ber of misidentified clusters per criterion bin) is uncovered 
as a function of richness or redshift, and clusters from all 
redshift or richness bins contribute to the artefact (these 
are therefore not shown here). In order to examine the 
relation with relaxation degree, we examined by eye 300 
randomly selected clusters, 30 clusters in each of the ten 
log 10 (A [/i _1 Mpc]) = 0.5 wide bins (in the range [—5 ^ 
log 10 (A [/i _1 Mpc]) ^ 0]), where for each cluster we marked 
down whether it seems relaxed (e.g., if the mass/light is 
clearly concentrated around the BCG identified by the cat- 
alogue), or not necessarily. Figure [7] (upper panel) shows an 
example of three such distinctive cases. In addition, for each 
cluster probed by eye, we examine also the distribution of 
offsets in the q and S grid (Figure [7] lower panel). This, 
since the check by eye is highly subjective, and the shape 



of offsets on the q and S grid may withhold the information 
needed to filter this artefact more quantitatively. 

We noticed, as can be expected, that unrelaxed clus- 
ters tend to contribute more to the artefact than relaxed 
clusters, but that distinction is weak and subjective, and 
we do not make further use of it in our work. However, as 
suggested above, by performing this procedure, we found 
that indeed filtering based on the offset distribution per 
cluster, in the q and S grid, helps to remove most of the 
artefact noise. As can be seen in Figure [7] clusters of the 
third kind therein (right column) exhibit offsets which are 
significantly larger than well-identified clusters, all through- 
out the q and S plane. Since one of our assumptions is that 
higher q, but especially S values, should comprise, in gen- 
eral, smaller offsets converging towards the BCG, the fact 
that in these clusters there is no such trend and the whole 
plane suggests high separation, seems to represent well the 
misidentified portion of clusters (or BCGs). In practice, by 
several iterations, we found that the median offset per clus- 
ter in the 36-point q and S plane is a good measure for this 
deviation, which should help in the noise/artefact cleaning 
procedure. Excluding clusters with median(A)> 40" (or any 
other threshold value somewhat lower than this), preserves 
the shape and location of the original main peak, and gets 
rid of most of the noise: about 600 clusters out of the ~ 750 
noise clusters were filtered out. This means that the overall 
noise level is now down to 150/500 ~ 30% in that region. 
This noise level is low enough to now reinclude that noisy 
region in the fit, so the fit will now be obtained by the entire 
range, and the (remaining) noise level in the high-end region 
will play only an insignificant role. 

As a last verifying step, we analysed a sub-sample of 
a few dozen clusters chosen randomly from the offsetted, 
misidentified candidate clusters, with a larger FOV, to make 
sure the noise was indeed an artefact from the frame size. 
Most of these clusters, as expected, created now a peak 
around the new FOV size. We note also, that analysing the 
full sample with a larger FOV is too expensive computation- 
ally, and in fact due to the reasons we elaborated here, there 
is actually no need to do so (eventually that would just shift 
the artefact peak further away). 

In Figures [S] and |S] we replot the probability density, or 
logarithmic histogram of offsets, after excluding the clusters 
designated as misidentified using the above criterion. The 
secondary peak, as seen, almost vanishes, and the overall fit 
to the entire range, is (log 10 (A [/i _1 Mpc])) = -1.895^;^, 
a = 0.501 ± 0.004 and R 2 of 0.992 (or (log 10 (A ["])) = 
0.564 ± 0.005, a = 0.475 ± 0.007 and R 2 = 0.99 for the arc- 
second logarithmic distribution), fully consistent with the 
result obtained above excluding high-offsets form the fit. 
This verifies our noise cleaning procedure efficient and se- 
cure, and the result of this fit is the one adopted as the final 
shape of BCG-to-DM offsets found in this work. 

Due to the very large number of clusters analysed, sta- 
tistical uncertainties could in fact be expected to be very 
small. One of the possible factors of uncertainty, is the effect 
of the photo-z uncertainty on the distribution in physical 
scales (hT 1 Mpc, Figure [5}, and should not affect the arc- 
second distribution (Figure |5J, where there is no conversion 
to distance. The typical photo-z uncertainty for the sample 
BCGs is 0.015. Assuming this error is relatively symmet- 
ric around the real redshift, the overall statistics of offsets 
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Figure 7. Example of three different scenarios seen in a check by eye of 300 randomly selected clusters, that helped to filter out most of 
the noise of spurious BCG or cluster detections. For each scenario the upper panel shows the resulting DM distribution, and the lower 
panel shows the offset deduced for each cluster in the q and S space. Left: A typically relaxed cluster, with one main central halo and 
other smaller and less massive mini-halos. This cluster is correctly identified and the offset values are in the range physically expected for 
relaxed clusters. Centre: A less relaxed, probably merging cluster. This cluster shows more prominent subhalos close to the central BCG, 
and the values for the offsets in the q and S grid follow the expected trend, and although partially large - we consider such clusters, or 
offsets, possibly real and include them in our analysis. Right: A misidentified cluster. No prominent BCG halo is seen in the DM map, 
and, the offset values in the q-S plane do not follow the expected relation and lie all above a higher threshold. We therefore make use of 
this quality to filter out such clusters from our further analysis. For more details see i]3.2l 



cannot be expected to change significantly. To assess the 
level of uncertainty the photo- z error may contribute, we 
repeated the analysis over the full 10,000 cluster sample by 
photometric redshifts drawn randomly from a normal dis- 
tribution centred on the catalogue photometric redshift for 
each cluster, with a width of a = 0.015 (which is the photo- 2 
error quoted in Hao et al. 2010). We obtain that the distri- 
bution remains nearly unaffected: (log 10 (A [/i -1 Mpc])} and 
a change by less than 0.1%. 

Another factor of possible uncertainty is the portion 
of cluster galaxies correctly assigned by the cluster finding 
algorithm and used to construct the mass model. Since we 
work with the assumption that light traces mass, our method 
is strongly coupled to the input cluster members and their 
photometry. For example, some clusters may exhibit galax- 
ies which are brighter in practice than the massive galaxy 
sitting at, or close to, the cluster potential well. In such 
cases, the catalogue is likely to assign the brighter galaxy 
as the "central" one, and a large offset will be measured 
to the true, physical centre. In addition, were (other) mem- 
bers erroneously assigned to a cluster (including different 
structures along the line-of-sight), or true members went 
uncovered, this should have an effect on the resulting DM 
centre. Luckily, brighter members are more probable to be 
correctly assigned for a given cluster, which are those gov- 



erning the fit. In addition, the effect of the DM polynomial 
fit degree is the most affecting parameter (Figure [TJ, and 
since for each cluster we take into account a wide range of 
polynomial degrees resulting in a wide range of possible DM 
centres, the relative effect of misidentified cluster members 
should be comparably minor. To test this quantitatively we 
resampled 100 randomly chosen clusters from the catalogue, 
where for each cluster we input additional (artificial) cluster 
members in random locations in the frame, and compare the 
result to the analysis before including these. To construct a 
strong upper limit, for each cluster we input 50% additional 
members than the current number of members listed in the 
catalogue, with brightnesses (and masses) random up to that 
of the BCG. By doing so we deduce that this inclusion of up 
to 50% more members changes the offsets by typically, an 
order of 0.1" (and in any case less than 2.2"), per cluster, 
resulting in less than a 0.1% change on the mean and width 
of the distribution for these clusters. Overall 10,000 clusters, 
this effect is clearly negligible. 

In addition, we note that we examined the effect of the 
richness of the clusters, specifically the number of members 
in each cluster, on the distribut i on. Suc h a trend was found, 
for example, by I Johnston et a l. (2007b) for the significantly 
offsetted or misidentified clusters. We find no such noticeable 
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trend, so clusters from all richness ranges contribute equally 
to the true offsets. 

Although these may be hard to assess quantitatively, 
some additional contaminating factors should be at least ac- 
knowledged. For example, we note that per cluster, both the 
BCG and correspondingly, the DM centre deduced, originate 
from the same cluster catalogue. Therefore, were the true 
BCG not assigned to the cluster (often beca use its colours 
are b l uer that other red-s equence galaxies; e.g. jBildfell et all 
l200Sl ; |Pipino et al.ll201ll) . clearly it would be centred on the 
wrong bright galaxy which we consider accordingly to be 
the BCG. This would render the offsets for this cluster non- 
credible. The level of such "bluer" BCGs which may b e un- 
covered was previously assessed by IPipirio et al.1 (|201ll l. for 
example, to be only ~ 8 per cent in a 69,000 SDSS cluster 
catalogue, and so its effect on the whole sample should be 
of the same order. Inte restingly, in a smalle r sample of 48 
X-ray luminous clusters. iBildfell et al.l (|2008h found a higher 
percentage of bluer-core BCGs (~ 25%), and noted that in 
these clusters, the BCG lies within ~ 10 kpc of the X-ray 
peak. 

It should be stressed, in addition, that we do not at- 
tempt to deduce one single offset measurement per cluster, 
but for each cluster expand our full parameters space to 
include a full range of possible offsets (coupled to our prior 
distribution based on the success of this modelling method) . 
The high number of clusters analysed in this manner allow 
us to deduce the credible posterior distribution of offsets 
presented here, given the prior choice of parameters range, 
as we now discuss. 

3.3 Systematic Uncertainty from Priors 

In the previous subsection, we discussed some (statistical 
and systematic) uncertainties relevant for our result. Here, 
we now aim to discuss the effect of prior choice of parameters 
on the resulting distributions. As seen in Figures [T] and [5] 
the resulting posterior distribution depends insignificantly 
on the prior distributions adopted for the q parameter, but 
strongly depends on the S parameter. Since fits are not con- 
strained by multiple images, for each cluster one does not 
know what the best-fitting q and S values would be (which 
would result in a single offset value per cluster). The only 
way to quantify related uncertainties is to probe a broad 
range of q and S parameter values per cluster. We choose 
q 6 [1, 1.5] and S £ [4,24] (see and sample this param- 
eter range at 36 equidistant points per cluster. These sam- 
ples are used to construct the 360,000 point histograms seen 
throughout. This amounts to Monte-Carlo sampling with 
wide, flat priors for q and S and takes into account the large 
uncertainties arising from the possible range of parameters. 
The resulting offset distribution is convolved with the (in- 
ferred) intrinsic distribution with the posterior distribution 
corresponding to Figures Q] and [5] Flat priors are chosen pri- 
marily in order not to bias the results by the lack of knowl- 
edge on how the q and 5* parameters should be distributed 
in practice. 

The first step in assessing the resulting level of system- 
atic uncertainty is therefore simply to examine Figures [T] 
and[5] The boundary cases (e.g., between S = 4 and S = 24) 
comprise a maximal systematic uncertainty of (the order of) 
log 10 (A [/i _1 Mpc]) = -0.45, for example, about ~ 24% of 



the obtained mean offset ((log 10 (A [ft _1 Mpc])) = -1.895). 
However, a more realistic estimation of the level of system- 
atic uncertainty may come from incorporating more specific 
prior distributions for the q and S parameters. Since we 
have analysed about 30 clusters with our method previously, 
we can make use of the q and S distribution among these 
clusters to construct more sophistically guessed priors. Al- 
though these are not shown here, they entail a distribution 
of, crudely, mean of q ~ 1.25 and S ~ 10, with a q ~ 0.15 
and as — 4. We use now these priors to generate a poste- 
rior distribution to compare with the posterior distribution 
obtained by initially incorporating flat priors. By doing so 
we obtain a systematic uncertainty of ~ 15% on the log- 
normal mean, and ~ 10% on the width of the (log-normal) 
distribution. 

Throughout, however, we adopt and work with the re- 
sult of the flat priors, since the allegedly more specific pri- 
ors are based on previous analyses of HST images. One has 
to construct a (larger) sample of the SDSS clusters well- 
constrained with multiple images (e.g . iHennawi et al.ll2008l ; 
iBavliss et ai]|201ll ; lOguri et al.ll2012l ). in order to securely 
adopt an interior, prior distribution for q and 5*. Until an 
explicit analysis of such a sample with our method is made 
available, we therefore adopt the more conservative, flat 
prior distribution results. 



4 DISCUSSION 

Although only relatively little work was conducted on the 
subject, the DM to BCG offsets have been increasingly ex- 
amined in recent years, due to growing technical and ob- 
servational capabilities enabling statistical studies of large 
samples, such as stacked WL analyses, and since a knowl- 
edge of the DM-BCG typical offset plays an important role 
in them. Here we further discuss the results of our work and 
other possible effects of uncertainty. 

In |3] and Figures [5] and [6] we presented the measured 
distribution of BCG-DM offsets. An important question that 
arises given the measured distribution, is whether the BCGs 
are on average centred in their cluster potentials (described 
by the smoothed light distribution) with some finite devia- 
tion, or whether there is any evidence for a significant off- 
set. To address this question, we perform a student-t test, 
which tests the hypothesis that a set of random variables 
is drawn from a Gaussian with either known or unknown 
mean and variance. The specific hypothesis tested in our 
case is whether the measured offsets can be considered as 
drawn from a Gaussian with zero mean and arbitrary vari- 
ance. Suppose the projected BCG coordinates relative to 
their cluster centres are indeed random variates drawn from 
a two-dimensional Gaussian distribution, then their radial 
distances to the cluster centres, which we have measured 
here, are clearly not distributed in a Gaussian way. However, 
we can then return the radial distances into a set of Gaussian 
random variates by multiplying with the sine or cosine of a 
random phase with a flat distribution between [0, 2ir] . This 
multiplication with a random phase removes the geometrical 
effect that the differential area shrinks proportional to the 
distance from the centre, which makes radial distributions 
peak at finite radii. This corresponds to projecting the radial 
distance on an arbitrarily oriented coordinate axis. Thus, if 
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Figure 8. Offset histograms as a function of redshift. For each bin we denote the number of data points, Ntot, and the mean and width 
of the distribution, fi and cr, respectively. Although insignificant given the distribution width in each bin, a steady trend is seen in the 
mean of the sample, where higher redshift clusters show higher BCG-DM separations, and generally, lower widths of the distribution 
(although the high-end of the last three bins may be affect by the size limit of our analysing frame). See also Figure fTOl 



our hypothesis is true, the radial distances multiplied with 
the sine or cosine of a random phase angle are Gaussian 
variates {xi} with zero mean. To this set of numbers, the 
student-t test can now be directly applied. 
Its normalised test statistic, 



T(x) = 



Nx 



s{x) 



(1) 



where x and s(x) are the mean and the standard deviation 
of the set {xi}, follows a student-t distribution with N — 1 
degrees of freedom if the hypothesis is true. For our sam- 
ple, we clearly have to reject this hypothesis. While the set 
{xi} can be well described by a Gaussian distribution, this 
distribution does not have zero mean. We obtain that the 
absolute t-statistics is 0.1(0.19), with an error probability of 
0.42(0.46), that the linear(logarithmic) distribution is drawn 
from a Gaussian with a zero mean. We therefore conclude 
that although relatively small, the typical offset is signifi- 
cantly, non-zero. 

In Figures [8] and [9] we plotted the offset distribution in 
different redshift bins. The evolution of the mean and width 
of the (log-normal) distributions as a function of bin redshift 
is summarised in Figures [TOl and [TT1 As can be seen therein, 



the mean of both the (log-normal) distribution in physical 
scales and in angular scales, increases steadily with redshift. 
The observed evolution, is however not significant: it is of the 
order of la for physical scale distribution, and only half a a 
for the angular scale offset distribution. This low significance 
renders these trends at best, tentative. We note, however, 
that in a recent work (|Zitrin et al.ll2012T ). in which we anal- 
ysed the Einstein radius distribution of the same GMBCG 
catalogue, we also uncovered similar (insignificant but mono- 
tonic) evolvement in redshift: throughout the same (volume- 
limited) redshift range, the mean Einstein radius decreases 
continuously with redshift. If real, these evolvements in red- 
shift, in both the Einstein radius and the DM-BCG offset 
distributions, could together help characterise the evolution, 
relaxation, and merger history of galaxy clusters more gen- 
erally, in addition to other complementary studies. 

As mentioned (HJ, lOguri et al.l (|2010l ) observationally 
examined the offset of the BCG from the centre of mass 
obtained in weak lensing (WL) analyses of a sample of 25 
clusters. They found that the DM centre is overall consis- 
tent with that of the BCG (within 2er level), and that the 
observed distribution can be described by two components. 
The first, significant component describing the small offsets, 
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Figure 9. Same as Figurc[8] but for distributions in angular logarithmic scales. See also Figure [TT1 



is a 2D Gaussian with a — 0.09 ft -1 Mpc, and the sec- 
ond less-significant compone nt describing the tail of larger 
separations, is fitted by the Ijohnston et al.l (|2007bh find- 
ing (see also iHilbert fe Whita 2010l ): a 2D Gaussian with 
a — 0.42 h^ 1 Mpc. Note that in our work we do not charac- 
terise the offset distribution of the misidentified clusters, but 
only the correctly identified ones. In addition, we can not ex- 
plicitly compare to the results of lOguri et al.l (|2010T ). since 
in their work the centres are those determined by fitting 
a symmetric DM distribution (e.g., NFW) to the overall, 
larger-scale WL data (this in fact was recentl y found to be 
prob lematic especially in merging clusters, see lGeorge et al.l 
120121 ). while in our work we simply measure the range of 
possible locations of the central DM peak. 

IShan et all (|2010al ) characterised the offsets between 
the X-ray p eaks and lensing centres in 38 clusters (see also 
lAllen|[l99ct ). Although most clusters show small offsets, as is 
also usual ly seen in such lensing-centre to BCG compar- 
ison (e.g. ISmith et alJ 120051 : iRichard et al.l l2010l ) showing 
typically less than 5" offsets compatible with our results 
here, about 45% of their clusters and especially the merg- 
ing, multiple-clump ones, show larger separations than 10", 
with a maximum of ~ 54". This, however, may be a result 



of either large fractions of unrelaxed clusters in their sample 
(~ 60%), and, the ensemble of different SL techniques used 
for the comparison, many of which pre-assume or iterate for 
the DM centre while adopting a symmetric DM distribution 
(see references therein), which may be unrealistic given the 
perturbed and complex matter distribution seen especially 
in unrelaxed clusters. In that sense, such offsets or even the 
known disc repancy between mass estimates from lensing and 
X-ray Ce.g. lAllenlll998!:|Richard et al.ll2o"i"oT ). may not be sur- 
prising (see lShan et al. 2010b). This in fact is a crucial point 
to make here, as we implied above: In our work we do not 
fit to the data a symmetric DM distribution for which the 
effective DM centre may be in practice different than the 
DM peak, whose offset from the BCG we characterise here. 

Several studies have also examined the off sets between 
the BCG and the X-ray peak or centroid (e.g. iLin fe Mohi 
12004 iMaughan et all 120081 ). Recently, iMann fe Ebeline 
( 20121 ) examined the X-ray peak and centroid offsets from 
the BCG, in 108 of the most luminous X-ray clusters, with 
the goal of constraining the evolution with redshift of the 
cluster merger fraction, so that they also characterised the 
evolution of such offsets, with redshift. Similar to our result, 
they also found that the distribution is (roughly) log-normal, 
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and centred at 11.5 kpc (Ho = 70 km s _1 Mpc" 1 ) for the 
offset of the BCG from the X-ray peak, overall similar to, or 
of the same order of, the peak centre we find for our BCG- 
DM offset distribution: 12.7 ft" 1 kpc. For the BCG offset 
from the X-ray centroid, their peak is ~twice as large. In 
addition, they found an evolution for these offsets with red- 
shift, so that higher separations are generally expected for 
higher redshift clusters, probably as a sign of higher merging 
fraction, similar to the (tentaive) evolution we observe here 
between the DM peak and the BCG. This evolution gener- 
ally agrees also with other complementary studies such as 
(a different X-ray sample) brightness centroid shifts, metal- 
licity, brightness profile steepness, or other similar relations 
found in cluster evolution works (e.g. iMaughan et alj|2008t 
and references therein). 

A similarly interesting question which should be inves- 
tigated quantitatively, is the correlation between the offset 
magnitude and the degree of relaxation. While in our work 
the BCG to DM peak offset may imply, generally, the de- 
gree of relaxation, other independent relaxation measures 
would be needed in order to obtain at least a reference 
sample for one to find a correlation between these. In a 
similar manner, the degree of relaxation have been defined 
by various (simulat i on-based) works in the literature (e.g. 
iThomas et~aT1l200ll ; iNeto et al.ll2007f ). where the offset be- 
tween the potential minimum and the centre of mass, con- 
stitut es one of the measures for relaxation fsee lLemze et al] 
120121 ). however these require knowledge of both. It would 
be therefore worth developing other independent relaxation 
measures to compare with, in future studies. 

In a similar context, and although in our previous anal- 
yses we have successfully covered large ranges of these pa- 
rameters, once many more clusters are analysed in detail 
using multiple images, in combination also with WL data 
where possible, it would be interesting to test whether the 
procedure described here still applies to new limits of rich- 
ness, luminosity, relaxation, concentration, mass, et cetera. 

We also note, that an alternative smoothing process 
to consider, could be based on a Gaussian filter smooth- 
ing rather a fit to a 2D polynomial as performed here, since 
the latter could suffer from various edge effects and other 
boundary condition artefacts (although, importantly, with 
no noticeable effect on the central, DM peak location; the 
length scale is an order of magnitude different). The reason 
we did not use a Gaussian smoothing, but the 2D spline 
interpolation, is that the latter method is well vetted and 
works remarkably well in reconstructing many mass distri- 
butions, as seen in our previous lensing analyses mentioned 
throughout. We are, however, in the process of examining 
the Gaussian smoothing alternative, with the goal of estab- 
lishing the range of relevant Gaussian-filter widths to be 
implemented (e.g. as priors) in the future analyses, for com- 
parison. 

In our work here we made an additional attempt to 
identify the level of misidentified and grossly miscentred 
clusters. We found that substantial part of the misidentified 
clusters show a typical offset behaviour in the q-S param- 
eter plane, of which we made use to exclude such clusters 
from the fit and estimate their portion in the whole sam- 
ple. Accordingly, we found that about 10% of the clusters 
in the catalogue may be misidentified, which is smaller but 
of the same general order as found in the previous studies 



-1 



< 





y- 1.94*x-2. 
y = - 2.95 - x 2 + 


>4 

3.93*x - 


2.93 












1.5 




















-2 






















2.5 












-3 









0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.55 0.6 



Figure 10. Growth of the log-normal distribution mean, 
(log 1 Q(A[/i _1 Mpc])) (open circles), and width, a (vertical error 
bars), as a function of redshift. The horizontal error bars repre- 
sent the redshift bin width. Although it is insignificant, i.e. of the 
order of la, a tentative trend is seen as a function of redshift, 
so that higher separations and lower widths are seen for higher- z 
clusters. The linear and quadratic least-squares fit to the data arc 
given in the upper left corner, and overplotted as dash-dotted red 
and black lines, respectively. 
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Figure 11. Same as Figure 1101 but for the log-normal distri- 
bution in arcsecond binning. This Figure, along with Figure [TOl 
shows that there exists a tentative (of the order of only half a 
a here) but steady trend possibly originating from the evolution 
and merger history of clusters, so that higher redshift and there- 
fore less relaxed clusters tend to show (both physical-distance and 
angular) higher separations, and lower (log-normal) distribution 
widths. One way to interpret the minor distribution-width trend, 
is that higher redshift clusters exhibit a somewhat more coherent 
population, say, in terms of relaxation, than lower redshift clus- 
ters that show a wider variety of relaxed and unrelaxed clusters. 
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mentioned throughout. In that sense, also, note that we do 
not attempt to characterise the offset distribution of these 
misidentified clusters, nor can we distinguish them from real, 
grossly miscenetred clusters, if these indeed exist and sur- 
vived our cleaning procedure. Both by the offset log-normal 
shape we find here which drops for higher offsets, and the 
difference between the 10% noise level we find and the level 
found in other studies, the effect of both largely offsetted and 
misidentified clusters in the remaining sample (after clean- 
ing the misidentified clusters we spotted in our procedure) 
on our results, is expected to be minimal (see also i]3.2l) . 



5 SUMMARY 

In this work we investigated the distribution of BCG to DM 
centre offsets, with an automated mass-modelling method 
(usually used for lensing analyses), based on the well-tested 
assumption that light traces mass. The modelling includes 
a lumpy component representing the cluster galaxies, and a 
smooth component (of the galaxy distribution) representing 
the DM. The success of this assumption in describing the 
mass distributions of lensing clusters assures us that the 
mass model is able to determine the effective DM peak or 
centre, that can be then compared with the position of the 
BCG. Unlike typical lensing analyses, we do not work here 
with multiple images to constrain the fit, and so we take into 
account a full range of smoothing (polynomial fit) degrees 
as a flat prior, to cover all options of offsets per cluster. 

Few previous studies have dealt with the DM to 
BCG offset, although these were either performed using 
numerical simulations mock catalogues probing the high- 
end of the misidentified o r significantly miscentr ed clus- 
ters (| Johnston et al.ll2007bl ; Iffilbert fc White! |2010| ) finding 
a 2D Gaussian width of a = 0.34 - 0.42 h' 1 M pc, or by 
a WL analysis of 25 clusters l|Oguri et al.ll2010l . 2D Gaus- 
sian width of a — 0.09 hT 1 Mpc, and a secondary tail with 
a — 0.42 hT 1 Mpc), which unfortunately entails a relatively 
low resolution to co mpare to our results. Recent work by 
iGeorge et all (|2012l ). establishes in 129 galaxy groups (al- 
though as massive as < 10 14 Mq) that the typical WL halo 
centre to BCG offset is smaller, around 75 kpc. In addition, 
BCG to X-ray (or SZ) peak and centroid offsets have been 



also characterised (e.g . Lin & Mohr 2004; Maugha n et al 
20081 ; iMann fc Ebelineil2012l ; ISehgal et aljboifl ISong et a! 



2012J), as well as BCG or X-ray, to SL centre offsets previ- 



ously derived for small samples of clusters (e .g. ISmith et al.l 
l2005l ; lRichard et al.|[201(il ; IShan et al.ll2010bh . Here, we have 
taken upon to concentrate and characterise the projected 
offset distribution with a high-resolution analysis, incorpo- 
rating a mass-modelling method used for detailed SL analy- 
ses, and performed over an unprecede ntedly large samp le of 
10,000 SDSS clusters drawn from the lHao et alT (|201Cj ) cat- 
alogue. Such an automated study, in addition to charcter- 
ising the offset distribution which is of its own importance, 
could help identify highly perturbed, potentially merging 
'bullet-like" clusters and to statistically constrain the DM 
cross-section, especially w hen complementary X-ray obser- 
vations are in hand (e.g . IClowe et al.l 120061 ; iBradac et al.l 
l200Sl ; lMerten et alj|201lf) . 

The first of the two main results of this work, is there- 
fore the offset distribution (Figures [5] and [6]), and the 



fact it has a non-zero peak. The offsets are well described 
by a log-normal distribution, with (log 10 (A [h, _1 Mpc])} = 
— 1.895±^;2oI, and a = 0.501 ± 0.004 (95% confidence lev- 
els), or (log 10 (A["])} = 0.564 ±0.005, and a = 0.475 ±0.007. 
Here the mean log offsets correspond to A ~ 12.7 ft~ 1 kpc, 
and A ~ 3.6", respectively. Note tha t these offsets are signif - 
ica ntly smaller than th ose probed bv lJohnston et aL (2007b) 
and lQguri et~ai1 l|2010f ) , but more similar to the various BCG 
to X-ray peak and centroid offsets, or B CG and X-ray to S L 
centre offsets, mentioned above (see also lGeorge et al.ll2012T ). 
However the centre of DM in the aforementioned WL or SL 
analyses is usually determ i ned b y a fit to certain symmetric 
model, e.g jNavarro et all l|l996l . NFW). It is not surprising 
therefore that the results will be somewhat different, as the 
fit of the SL or WL data to an NFW profile, for example, 
is susceptible to the overall shape of the 2D mass distribu- 
tion (and out to larger scales in the WL case); while in our 
smoothing procedure we do not fit to a specific model, and 
explicitly examine the offset between the BCG and high- 
est peak centre of the smooth DM map. Our results show 
that the effect on, say, future WL analyses, of the "real" 
DM-BCG offset, is therefor e expected to be relativ ely small 
(see Eq. 9 and Fi gure 4 in Ijohnston et al.l l2007bl ; see also 
IGeorge et al.|[2012l and references therein), although we leave 
the exact assessment of this effect for relevant future work. 

To establish how significant is the typical offset being 
different than zero, we performed the Student t-test. We ex- 
amine if the (both linear and log) distributions could have 
been drawn from a Gaussian distribution with a mean of 
zero. To do this we first convert the offsets back to one- 
dimensional Gaussian random number by multiplying with a 
random phase. The absolute t-statistics is 0.1(0.19), with an 
error probability of 0.42(0.46), that the linear(logarithmic) 
distribution is drawn from a Gaussian with a zero mean. 
We therefore conclude that although relatively small, the 
typical offset is significantly non-zero. In addition, the ef- 
fective DM centre we adopt here, namely the peak of the 
smoothed light distribution representing the DM, can con- 
stitute a natural and alternative definition of cluster centers 
for optically-selected cluster catalogues. 

The second main result of this work, is that the offset 
distribution shows on average, a steady trend with redshift 
(although given the distribution widths the trend is rendered 
so far insignificant, see Figures [lOl and \TT} . Higher redshift 
clusters show generally larger separations, both in physical 
scales and angular scales, and more mildly, also smaller dis- 
tribution widths. This evolution with redshift, although ten- 
tative, may be related evidently to cluster evolution and re- 
laxation processes, as less relaxed clusters which are younger 
and therefore more often found in higher redshifts, have a 
more spread-out mass distribution and tend correspondingly 
to exhibit larger BCG-DM offsets. In other words, the mat- 
ter in lower-redshift clusters has had more time to fall into 
and form a well-defined potential well. This result is consis- 
tent with brightness centroid shifts, metallicity, brightness 
profile steepness, and other similar relations found in cluster 
evolutio n works (e.g. iMaughan et alj |2008|. and referenc es 
therein; IMann fc Ebelindl2012l . see also lzitrin et al.ll2012fl . 

The origin of the (slightly) smaller distribution widths 
in higher-redshift clusters is not perfectly clear, although 
similarly, it may well be a characteristic for the diversity 
of the cluster population: higher-redshift clusters will be 
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mostly unrelaxed, whereas lower-redshift clusters can be 
more of a mixed population in a somewhat wider variety 
of relaxation states. In addition to our assessment of the 
DM-BCG offsets distribution, the tentative trends seen here 
as a function of redshift are real, the results of this work 
could in turn constitute a complementary and independent 
measure of cluster evolution history, in future studies. 

It should also be mentioned that the results of our work 
are highly coupled to the cluster catalogue in use and its 
inherent uncertainties. Future comparisons with indepen- 
dent catalogues, whose clusters are not necessarily optically- 
selected, should aid in establishing further our results. 
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